Multi-Objective Query Processing for Data Aggregation

نویسندگان

  • Jianchun Fan
  • Subbarao Kambhampati
چکیده

Most fielded data integration systems focus on data aggregation applications, where individual data sources all export fragments of a single relation. Given a query, the primary query processing objective in these systems is that of selecting the appropriate subset of sources so as to optimize various user objectives regarding the completeness and quality of the answers and the response time. In this paper we consider three specific objectives: coverage, density and latency. To handle the often conflicting nature of these objectives, we introduce a joint optimization model for source selection that supports a spectrum of trade-offs between them. We also introduce our techniques in defining query classes for different types of statistics and learning statistics with respect to corresponding query classes from the autonomous data sources. We present comprehensive evaluation to demonstrate the effectiveness of our multi-objective query processing model for data aggregation scenarios.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Query Processing for Data Aggregation : Mining , Using and Maintaining Source Statistics

Most data integration systems focus on “data aggregation” applications, where individual data sources all export fragments of a single relation. Given a query, the primary query processing objective is to select the appropriate subset of sources to optimize conflicting user preferences. We develop an adaptive data aggregation framework to effectively gather and maintain source statistics and us...

متن کامل

Query Planning for Range Queries with User-defined Aggregation on Multi-dimensional Scientific Datasets

Applications that make use of very large scientific datasets have become an increasingly important subset of scientific applications. In these applications, the datasets are often multi-dimensional, i.e., data items are associated with points in a multi-dimensional attribute space. The processing is usually highly stylized, with the basic processing steps consisting of (1) retrieval of a subset...

متن کامل

انتخاب مناسب‌ترین زبان پرس‌وجو برای استفاده از فرا‌‌پیوندها جهت استخراج داده‌ها در حالت دیتالوگ در سامانه پایگاه داده استنتاجی DES

Deductive Database systems are designed based on a logical data model. Data (as opposed to Relational Databases Management System (RDBMS) in which data stored in tables) are saved as facts in a Deductive Database system. Datalog Educational System (DES) is a Deductive Database system that Datalog mode is the default mode in this system. It can extract data to use outer joins with three query la...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Multiresolution Data Aggregation and Analytical Exploration of Large Data Sets

Analytical processing of large relation data demands for a shared compact representation of data in multiple resolutions in order to efficiently facilitate the incurring data aggregation, data cube, and range queries. This paper addresses technical problems of multi-resolution data aggregation and investigates enabling technologies for efficient analytical processing of large data sets. In part...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006